Assessing Quality of Unsupervised Topics in Song Lyrics

نویسندگان

  • Lucas Sterckx
  • Thomas Demeester
  • Johannes Deleu
  • Laurent Mertens
  • Chris Develder
چکیده

How useful are topic models based on song lyrics for applications in music information retrieval? Unsupervised topic models on text corpora are often difficult to interpret. Based on a large collection of lyrics, we investigate how well automatically generated topics are related to manual topic annotations. We propose to use the kurtosis metric to align unsupervised topics with a reference model of supervised topics. This metric is well-suited for topic assessments, as it turns out to be more strongly correlated with manual topic quality scores than existing measures for semantic coherence. We also show how it can be used for a detailed graphical topic quality assessment.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LyricsRadar: A Lyrics Retrieval System Based on Latent Topics of Lyrics

This paper presents a lyrics retrieval system called LyricsRadar that enables users to interactively browse song lyrics by visualizing their topics. Since conventional lyrics retrieval systems are based on simple word search, those systems often fail to reflect user’s intention behind a query when a word given as a query can be used in different contexts. For example, the word“tears”can appear ...

متن کامل

Mining Sentiments from Songs Using Latent Dirichlet Allocation

Song-selection and mood are interdependent. If we capture a song’s sentiment, we can determine the mood of the listener, which can serve as a basis for recommendation systems. Songs are generally classified according to genres, which don’t entirely reflect sentiments. Thus, we require an unsupervised scheme to mine them. Sentiments are classified into either two (positive/negative) or multiple ...

متن کامل

Lyric Jumper: A Lyrics-Based Music Exploratory Web Service by Modeling Lyrics Generative Process

Each artist has their own taste for topics of lyrics such as “love” and “friendship.” Considering such artist’s taste brings new applications in music information retrieval: choosing an artist based on topics of lyrics and finding unfamiliar artists who have similar taste to a favorite artist. Although previous studies applied latent Dirichlet allocation (LDA) to lyrics to analyze topics, LDA w...

متن کامل

Addendum to “Multiple Lyrics Alignment: Automatic Retrieval of Song Lyrics” Technical Report

The purpose of this technical report is to discuss two additional aspects of automatic lyrics retrieval as described in “Multiple Lyrics Alignment: Automatic Retrieval of Song Lyrics” by Knees et al., 2005. The first aspect is the introduction of a confidence measure to estimate the quality of the generated output. The second aspect deals with the automatic formatting of generated lyrics to pre...

متن کامل

Automatic Prediction of Hit Songs

hit song detection, music classification We explore the automatic analysis of music to identify likely hit songs. We extract both acoustic and lyric information from each song and separate hits from non-hits using standard classifiers, specifically Support Vector Machines and boosting classifiers. Our features are based on global sounds learnt in an unsupervised fashion from acoustic data or gl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014